New Study: Multi-task Performance of Large Model-driven Robot Vacuums is Poor, Success Rate Only 40%
Andon Labs evaluation shows that top large model robot vacuums have a success rate of only 40% in performing multi-step tasks such as 'delivering butter'. The task involves complex steps like cross-room localization, object recognition, locating moving humans, delivery, and returning to charge, highlighting the limitations of AI in home environments.